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The transmissions as functions of energy are central for electron or phonon transport in the 
Landauer transport picture. We suggest a simple and computationally “cheap” post-processing 
scheme to interpolate transmission functions over fc-points to get smooth well-converged average 
transmission functions. This is relevant for data obtained using typical “expensive” first principles 
calculations where the leads/electrodes are described by periodic boundary conditions. We show 
examples of transport in graphene structures where a speed-up of an order of magnitude is easily 
obtained. 


I. INTRODUCTION 

Calculations of electronic conductance based on first 
principle methods such as density functional theory 
(DFT) provide a valuable tool in order to gain insights 
into electronic transport in nano-conductors and compar¬ 
ison to experiments without employing fitting parame¬ 
ters. This is for example the case in the field of single- 
molecular device^ 2 . Popular methods are based on DFT 
in combination with the non-equilibrium Green’s func¬ 
tion approach (DFT-NEGF), see e.g. [I, T2| and fl5, or 
scattering wave-function approaches^*. The electrodes in 
such calculations are typically treated employing peri¬ 
odic boundary conditions in the direction transverse to 
the transport direction with a corresponding Appoint av¬ 
erage of the electronic states and transmissions. This 
means that for each transverse fc-point the system essen¬ 
tially behaves as a one-dimensional conductor with di¬ 
verging density of states and discontinuities in the trans¬ 
mission function at energies corresponding to band on- 
sets/channel openings. It is well known that often in 
order to obtain smooth, well-converged density of states 
and transmissions as a function of energy, a substantial 
number of transverse fc-points are needed due to the rapid 
variations of these functions for individual fc-points. Cer¬ 
tain quantities, for example the Seebeck coefficient and 
thermo-electric figure of merit (ZT) are based on the de¬ 
tailed behavior of the transmission^ 1 ^ and thus exceed¬ 
ingly sensitive to energy and fc-resolution of the calcula¬ 
tions. This can amount to a significant computational 
burden for large systems treated by first principle meth¬ 
ods. Thus it is highly interesting to devise simple ways to 
make this more efficient and get maximum information 
from the data. 

Sophisticated methods to tackle this include the trans¬ 
formation to a smaller basis-set using maximally localized 
Wannier function^, or to construct a optimized minimal 
basis-set and using this to determine the transmission 1 ^. 
Both require one to examine the details of the chemical 
bonds in the system, relevant energy windows, and stor¬ 
ing wavefunctions, which tend to be elaborate. In this 
paper we present a simple and efficient post-processing 


interpolation scheme which can significantly speed up the 
convergence with respect to fc-points. We illustrate the 
method by applying it to various graphene-based nano¬ 
structures which are prone to bad convergence due to its 
vanishing density of states at the Fermi level. 

In the remaining parts of the paper we first explain 
the workings of the interpolation scheme in Sec. [TTJ while 
we investigate various test cases in Sec. m and finally 
discuss limitations to the scheme and conclude in Sec. ED 




Figure 1. Analysis of transmission in pristine graphene: (a) 
Transmission converges as the number of transverse fc-points 
increases (darker colors), (b) The mean absolute deviation 
from fully converged data shown for TranSiesta data before 
and after using our interpolation scheme. 
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(a) Data transformation 


(b) Matrix construction 


(c) Interpolation 


Figure 2. (Colour online) We seek an interpolation of the two datasets (Red, Blue) containing Ni and N 2 points, respectively. 
The three steps in the algorithm: (a) The original data sets are transformed into line segments of a maximum length £ = 0.1. 
Thus, for example the line segment a-c is split into 50 points, (b) The weighted Euclidean distance is calculated for all point 
on both curves, and the shortest path from (1,1) to (Ni, N 2 ) is found, (c) The curves are linearly interpolated using the found 
path with N = 10 intermediate curves. 


II. DESCRIPTION OF THE METHOD 

The use of computationally “expensive” first principles 
DFT-NEGF calculations for determining the transmis¬ 
sion through nano-structured systems is limited by the 
amount of time one can afford to spend on the fc-grid 
resolution. Often the rough behavior of the transmission 
is already seen with a limited number of fc-points but 
the convergence of the average is slow since the functions 
are changing abruptly with energy, e. g. around a band 
onset or a resonance. The position of the abrupt fea¬ 
ture will typically shift in a smooth way with changing 
fc-point, but a linear interpolation of the curves between 
two consecutive fc-points will be of little use since it will 
simply contain, say, half of each abrupt feature. Instead, 
we propose an interpolation scheme which can make use 
of a coarse, non-converged fc-grid, and thus reduce the 
computation cost simply by using a “clever” technique 
to approximate the transmission curves for intermediate 
fc-points. The method does not magically guess the cor¬ 
rect interpolated curves, and one has to have a reasonable 
amount of fc-point resolved transmission curves in order 
to obtain a useful result, but smooth averaged curves can 
be obtained, as we will illustrate below, using a signifi¬ 
cantly smaller number of fc-points. The interpolation is 
done by using a shortest-path solver to determine a cor¬ 
respondence between two fc-adjacent curves. The corre¬ 
spondence is then used to find intermediate curves which 
can be used to determine the averaged transmission. The 
proposed interpolation scheme consists of three separate 
steps, which will be described in detail in the following 
sections. 

In order to show the validity of our proposed scheme 
we have determined the transmission through pristine 
graphene for increasing number of transverse k -points 
(see Fig. 00 - The computational details are given in 
Sec. |III| We compare N = 6,8,..., 54 to a well-converged 


calculation with N = 600 fc-points. By applying our al¬ 
gorithm we significantly reduce the mean absolute de¬ 
viation from a fully converged transmission, as shown 
in Fig. E>- We see that the interpolated transmissions 
converge much quicker than the raw data. Fitting data 
with a power law, we can estimate the amount of k -points 
needed in order to obtain a deviation of less than one per¬ 
cent, thus giving a speed-up factor of ry = 220/41 = 5.37. 


A. Transform of data 

In the following we will outline the method in general 
terms and denote the data points by (x,y), correspond¬ 
ing to the transmission function data point, (E,Tk(E)), 
for given fc-point and energy in the concrete examples. 
Initially, we transform the set of data points (x, y) into 
points with a maximum Euclidian distance t. The new 
data points span line segments (when connected) on the 
curve C consisting of N different individual points. In 
practice the transformation of data is done by looping 
through all segment lengths L and inserting additional 
data points if L > l. Figure [2 ]a shows data where the 
distance between points a, c is much larger than £, and 
thus requires extra points between points a, c. The opti¬ 
mal value of the segment length t depends on the given 
data. A value close to the median of the original point 
distance is a good starting guess, and is chosen as the 
default value. 

The two curves in Fig. 0* are required to smoothly 
interpolate into each other - a process which can be 
estimated by hand (gray arrows). The proper path is 
found by overall minimization of the interpolation dis¬ 
tance, which will described in the following subsection. 
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B. Correspondence between curves 

Given two data sets, i. e. the curves £±,£ 2 , we con¬ 
struct a matrix V containing the weighted Euclidean dis¬ 
tances between points on opposite curves, 



where i = 1... N \, j = 1... N 2 denote all points on the 
curves £\,£ 2 , respectively. Thus, the dimension of the 
matrix V is N\ x N 2 . The weights w XiV are added to 
ensure a degree of tunability when interpolating. This 
is due to the fact that x and y are different quantities 
and thus have different scales. In order to interpolate 
one curve into the other we need to determine the short¬ 
est path for all points on either curve. This is done by 
considering the distance matrix as a landscape and mov¬ 
ing from the start (1,1) to the destination {N\, N 2 ) using 
the smallest cost possible. This is essentially identical to 
finding the minimum energy path on an potential energy 
surface, which can be done using either string methods or 
the nudged elastic band approach 1 - 1 . We will instead use 
Dijkstra’s algorithm, which loops through coordinates in 
the distance landscape and iteratively compares all found 
paths until the target point is found. Each iteration is 
done while keeping track of the cost and path. For more 
information see Ref. SI The coordinates along the re¬ 
turned shortest path is saved in a correspondence matrix 
C containing two columns (Ci,C 2 ), which is used in the fi¬ 
nal step of our routine. The distance matrix D in Fig. §3 
shows the shortest path as a white line propagating along 
the distance landscape minimum (black/dark blue). 


C. Interpolation of curves 

Once the data is transformed and a correspondence 
matrix is obtained we interpolate the curves £\,£ 2 . In¬ 
stead of a simple linear interpolation we use the corre¬ 
spondence matrix to obtain intermediate curves. A sim¬ 
ple linear, discretized interpolation is used, i.e., 

xi 2 a = olx 1 [Ci] + (1 - a)x 2 [C 2 \ (2) 

yi 2 a = ayi [Cl] + (1 - a)y 2 [C 2 ] , (3) 

where a is the interpolation parameter between zero 
(curve £ 2 ) and one (curve £ 1 ), and Xi[M] means that 
we use the indices M of the vector aThe found inter¬ 
mediate data sets {xi 2a ,yi 2 a) can then be sampled back 
to the original grid. In practice we loop over a to gen¬ 
erate the different interpolated curves. Figure [2]c shows 
the data in Fig. [2^, interpolated into N = 10 intermediate 
curves and transformed back onto the original grid. 

We note that the choice of weights w x ,w y depends on 
the input data scales. Changing the values can highly 
affect the outcome of the shortest-path solver, since the 
distance landscape is changed. Usually, it is advisable to 


rescale the data (using the weights) so that the two data 
ranges are comparable. Similarly, the length l has to be 
chosen wisely: A large value can result in crude interpo¬ 
lations while a too small value makes the algorithm too 
time-consuming. 

Finally, we note that the algorithm described in the 
previous subsections allows us to interpolate data in gen¬ 
eral. We can apply the algorithm specifically to trans¬ 
missions and DOS by providing it with the needed data. 
In the case of TranSiesta and the utility TBTrans 
we need to weight the interpolated curves using fc-grid 
weights and sum to obtain average transmission curves. 
In the case of 3D transport we have a 2D fc-grid, which 
can be investigated using bilinear interpolation. In the 
following section we apply the interpolation scheme to 
example cases. 


III. EXAMPLE CASES 

The usefulness of the presented algorithm is show¬ 
cased by considering transmission calculations through 
the simulated nano-systems shown in Fig. [3j (a) a pris¬ 
tine graphene sheet, (b) a graphene nano-constrictioiP, 
and (c) hydrogenated kinked graphene 11 . The shown 
structures have minimal unit-cells in the transverse trans¬ 
mission direction due to periodic boundary conditions, 
and carbon atoms are shown in black while hydrogen 
atoms are shown in white. 

The transmission through the graphene sheet in Fig. §1 
is calculated with the TranSiesta simulation suite, 
which utilizes a localized basis set. For the present work 
a DZP basis set is used in conjunction with a mesh cut¬ 
off of 300 Ry, a force tolerance of 0.02 eV/A, as well as 
a Monkhorst-Pack grid of 24 x 5 ensuring absolute con¬ 
vergence. Exchange and correlation is described using 
the PBE GGA functionaP. A minimal transverse unit¬ 
cell is used due to periodic boundary conditions, while 
an energy window of ±10 eV around the Fermi energy 
is considered both for a crude transverse transmission k- 
grid (Nk = 20) and a fully converged /c-grid (Nk = 600). 
The coarse transmission spectrum is interpolated using 
the presented interpolation scheme, as can be seen in the 
upper row in Fig. [4] The upper window of each col¬ 
umn describes the full transmission versus fc-point. A 
good agreement is seen between the interpolated and 
the converged data sets. A few places the interpolation 
guesses incorrectly (for instance around E = —4 eV for 
k = 0.25f). These occur since the input data is too 
coarse. However, despite the few differences between in¬ 
terpolated data and converged data, the averaged curves 
have a remarkable resemblance and the mean deviation 
is below 2%, as seen in Fig. [TJo. 

The transmission through the graphene nano¬ 
constriction and kinked graphene shown in Figs. i> and 
[3J: h ave been extracted from the original datasets (see 
RefsP-hd). and are shown in the middle and lower win¬ 
dows in Fig. [4] As with the pristine graphene example we 
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(a) Graphene sheet 
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(b) Graphene nano-constriction 



(c) Kinked graphene 


Figure 3. (Colour online) Structures used as example cases: 
(a) a graphene sheet consisting entirely of carbon atoms, (b) 
a graphene hydrogen-passivated nano-constriction, and (c) a 
kinked graphene sheet decorated with hydrogen along the 
kink lines. The left and right electrodes have been high¬ 
lighted with blue and red, respectively. Minimal unit-cells 
in the transverse transmission direction have been used due 
to the periodicity. Carbon atoms are shown in black, while 
hydrogen atoms are shown in white. 


graphene sheet, respectively. Thus, we have demon¬ 
strated that by applying a simple post-processing inter¬ 
polation scheme we can speed up convergence of roughly 
an order of magnitude. 

IV. COMMENTS AND CONCLUSION 

We have presented a simple post-processing interpola¬ 
tion scheme which speeds up transmission calculations 
on nano-structured materials. The algorithm uses a 
shortest-path solver to determine the optimal interpo¬ 
lation of a set of fc-dependent transmission curves, which 
ultimately can be summed to obtain a smoothed average 
transmission. 

We note that as a post-processing tool the algorithm 
relies on the quality of the original data. Since this data 
is used as a base for the interpolation any fluctuations 
will be present in the interpolated data and thus propa¬ 
gate to the final result. The present implementation of 
the shortest-path solver is based on Dijkstra’s algorithm 
which is stable but very slow 3 . A far quicker implemen¬ 
tation would be to use a heuristic to guide the shortest- 
path search in the distance landscape, thus changing the 
solver to an industry standard algorithm known as an 
A*-searctP. However, due to the complexity of the in¬ 
put data the construction of such a heuristic is not a 
straight-forward task. 

By considering three sample cases we have demon¬ 
strated that it can speed up calculations by roughly an 
order of magnitude. We have illustrated the method us¬ 
ing electron transport through graphene nano-structures 
and fc-point averaging of transmission functions. How¬ 
ever, the method is generally applicable also to phonon 
transport and to other functions such as density of states 
or other types of interpolation parameters such as elec¬ 
trostatic gating etc. 

Our interpolation scheme can easily be implemented 
in already existing code. We provide a sample MatLAB 
code online that can read and interpolate data obtained 
from TranSiesta and TBTrans 1 } which are built on- 
top of the ab-initio software package Siesta®!. 


see a remarkable resemblance to the converged data set. 
The discrepancies are negligible, which stems from the 
fact that a finer fc-point sampling has been performed in 
the original data. In the three cases we obtain speed-ups 
of approximately 5, 6, and 8, for the pristine graphene 
sheet, the graphene nano-constriction, and the kinked 
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Figure 4. (Colour online) Algorithm applied to raw transmission data for the graphene-based systems in Fig. [3] The raw data 
(left column) is interpolated (middle column) and compared to the fully converged transmission curve (right column). The 
upper window of each subfigure is the full transmission for each fc-value, while the lower window shows the averaged value. 
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